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ORIGINAL ARTICLE 

Methods for Predicting Diabetes Phase III Efficacy 
Outcome From Early Data: Superior Performance 
Obtained Using Longitudinal Approaches 

JB m\\er\ NR Kristensen\ S K\\m\ MO Karlsson^ SH Ingwersen^ and MC Kjellsson^ 

The link between glucose and HbA1c at steady state has previously been described using steady-state or longitudinal 
relationships. We evaluated five published methods for prediction of HbA1c after 26/28 weeks using data from four clinical 
trials. Methods (1) and (2): steady-state regression of HbA1c on fasting plasma glucose and mean plasma glucose, respectively, 
(3) an indirect response model of fasting plasma glucose effects on HbA1 c, (4) model of glycosylation of red blood cells, and (5) 
coupled indirect response model for mean plasma glucose and HbA1c. Absolute mean prediction errors were 0.61, 0.38, 0.55, 
0.37, and 0.15% points, respectively, for Methods 1 through 5. This indicates that predictions improved by using mean plasma 
glucose instead of fasting plasma glucose, by inclusion of longitudinal glucose data and further by inclusion of longitudinal 
HbAlc data until 12 weeks. For prediction of trial outcome, the longitudinal models based on mean plasma glucose (Methods 4 
and 5) had substantially better performance compared with the other methods. 
CPT Pharmacometrics Syst. Pharmacol. (2014) 3, e122; doi:10.1038/psp.2014.20; published online 2 July 2014 



In 2012, it was estimated that the total cost of diagnosed 
diabetes in United States alone was $245 billion J In order to 
lower such expenses, there is a clear need for development 
of cost-effective and safe antidiabetic medicine. At the same 
time, pharmaceutical companies are struggling to improve 
R&D productivity^ and to fulfill the needs of the growing 
patient population.^ Key drivers for increased productivity are 
correct decision making with regard to early selection of can- 
didates and doses — and improved decision making is likely 
to lead to lower attrition rate in phase II and 1!!.^ 

In development of antidiabetic medicine, the use of differ- 
ent biomarkers at each development stage" represents one of 
the main challenges for taking such decisions in drug devel- 
opment. Early clinical studies are typically of short duration, 
and efficacy is measured using biomarkers that reach steady 
state within a fairly short timeframe. Dependent on mode of 
action, efficacy is either assessed using glucose infusion 
rates from a glucose clamp study, glucose excursions fol- 
lowing glucose tolerance tests, or simply by measurements 
of plasma glucose (typically fasting plasma glucose (FPG)). 
Clamp tests and glucose tolerance tests often need to be 
performed in an inpatient setting, whereas plasma glucose 
measurements can also be performed at home (self-mea- 
sured plasma glucose (SMPG)), providing the opportunity of 
frequent assessment of treatment effect. Clearly, both FPG 
values analyzed in a laboratory, and SMPG values can be 
biased and hard to interpret due to various factors such as 
inconsistency in reporting concentration in blood or plasma 
values and day-to-day variation.^ '' 

Although debated, the predictive power of HbAlc for devel- 
opment of long-term complications of diabetes is indeed 
strong.^" Based on this, steady-state values of HbAlc have 
been chosen as the main efficacy end point for confirmatory 



phase III trials. In contrast, early clinical trials are often of 
short duration, not allowing HbAlc to reach steady state, and 
hence, values of glucose at steady state are often used as 
the main efficacy end point. Thus, in order to predict efficacy 
outcomes in confirmatory trials (based on early trial data), a 
translation from values of posttreatment steady-state glucose 
to posttreatment steady-state HbAlc is needed. 

The correlation between HbAlc and FPG has previously 
been established in several studies. "'^^ Similar relations have 
been proposed for the relationship between HbAI c and mean 
plasma glucose (MPG) calculated from 24-h glucose profiles 
based on either 7- to 11 -point SIVIPG profiles or continu- 
ous measurements.^^'" Measuring FPG is less of a burden 
from a patient's point of view. The drawback of using FPG for 
efficacy measurement is that the effect of drugs that mainly 
target the postprandial glucose (PPG) might not be accu- 
rately captured. As HbAlc is a function of overall glycemic 
exposure, ^^^'^ one would expect more accurate predictions of 
HbAlc based on MPG values compared with FPG values. 

Although steady-state relations between glucose and 
HbAlc have been calculated on large populations of sub- 
jects, they have generally not been tested and compared 
for prediction of clinical trial outcome across different types 
of treatments. Also, due to their inherent structure, they do 
not take into account the physiological and longitudinal rela- 
tions between glucose and HbAlc when subjects are under 
investigation in a clinical trial. Thus, new methods have been 
proposed that incorporate physiologic knowledge relating 
glucose measures to HbAlc and enable longitudinal descrip- 
tions of HbAI 0.1^-1" 

A direct comparison of methods for prediction of confirma- 
tory trial outcome based on early trial data can both provide 
guidance on which methods to select for prediction of HbAlc 
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and inform trial design witli regard to the clioice between 
FPG and 7-to 1 1 -point SMPG profiles and the choice of sam- 
pling schedule. To our knowledge, no extensive comparison 
of methods for predicting HbAlc across trials and treatments 
has been published. In this study, we evaluated the possibility 
of predicting phase III efficacy outcomes using different data 
sources (FPG, MPG, and early HbAlc). 

RESULTS 

An overview of trials included in this study is presented in 
Table 1 . With this data, we evaluated five different methods 
(Table 2) for prediction of steady-state HbAlc (here defined 
as HbAlc at 26/28 weeks in a clinical trial) based on data up 
to 12 weeks of treatment. In order to both evaluate the impact 
of using FPG vs. MPG and steady-state vs. longitudinal data, 
two steady-state methods were applied. Method 1 applies 
the steady-state relation between FPG and HbAlc previously 
derived by Samtani.^^ Method 2 is based on the steady-state 
relation between MPG and HbAlc previously published by 
Nathan et al.'"^ 

For incorporation of longitudinal FPG and MPG data, the 
models proposed by Samtani (Method 3)^^ and Lledo-Gar- 
cia et al. (Method 4)^" were applied. Method 3 is an indirect 
response model linking FPG to HbAlc, whereas Method 4 
is based on the relation between MPG and the lifespan and 



glycosylation of red blood cells (RBCs). Method 5 consists of 
two linked indirect response models in which predictions of 
MPG are related to predictions of HbAlc. Thus, only Method 
5 used both MPG and HbAlc as dependent variables. 
For each longitudinal model, all data up to 12 weeks were 
included. Thus, Method 3 included FPG and HbAlc data up 
to 12 weeks. Method 4 used MPG data up to 12 weeks, and 
Method 5 used MPG and HbAlc data up to 12 weeks. All 
methods were applied for prediction of all 12 treatment arms 
evaluated in the four trials as shown in Table i.2°-23 

Mean prediction errors at 26/28 weeks are presented for 
each treatment arm in Figure 1. Individual predictions from 
each of the five methods were obtained from 12-week data 
in line with the description for each method (see Methods 
section). All methods underestimated the observed values to 
various degrees. In particular, the application of FPG values 
(Methods 1 and 3) seemed to cause predictions to be much 
lower than the observed values. Numerical values of mean 
absolute prediction errors are presented in Table 3 for each 
treatment arm for the five different methods. For Method 5, the 
overall mean prediction error was -0. 1 3% points, whereas for 
the other methods, it ranged from -0.37 to -0.61% points. As 
seen, the regression-based method using FPG (Method 1) 
significantly underestimated HbAlc at end of trial for basal 
insulins (arms 1, 2, and 4). The predictions from the MPG- 
based methods (Methods 2 and 4) were more consistent 



Table 1 Summary of studies applied for comparison of steady-state HbAlc prediction methods 









HbAlc at 


Fasting plasma 


24-h glucose 


HbAlc 


Steady-State 








baseline 


glucose 


profiles 


measures 


HbAlc 


Study 


Arm 


n 


(%) 


(weel<s) 


(weel<s) 


(weel<s) 


evaluated at 


Rosenstook et al.'" 


Insulin glargine (comparator) 


211 


8.5 


0, 1, 2, 3, 4, 6, 8, 


0, 12 


0, 12 


28 weeks 




Insulin detemir 


198 


8.6 


10, 12 








Raskin et a/.^' 


Biphasic insulin aspart 


95 


9.5 


0-12 


0, 12 


0, 12 


28 weeks 




Insulin glargine (comparator) 


104 


9.7 










Zinman et al.^ 


Liraglutide 1 .8mg + metformin + rosiglitazone 


134 


8.6 


0, 2, 8, 12 


0, 8, 12 


0, 8, 12 


26 weeks 




Metformin + rosiglitazone (comparator) 


123 


8.3 












Liraglutide 1 .2mg + metformin + rosiglitazone 


145 


8.5 










Nauck et al}^ 


Liraglutide 0.6 mg + metformin 


182 


8.3 


0, 2, 4, 8, 12 


0, 8, 12 


0, 8, 12 


26 weeks 




Liraglutide 1 .8mg + metformin 


168 


8.4 












Glimeperide + metformin (comparator) 


191 


8.4 












Metformin 


78 


8.3 












Liraglutide 1 .2mg + metformin 


176 


8.3 











Table 2 Summary of five methods compared for prediction of HbAlc at steady state 


iVIethod number (study) 


independent variables (data source) 


Dependent variables (prediction) 


Model description 


Method 1 (Samtani") 


FPG: samples taken at 12 weeks 


Steady-state HbAlc 


Linear regression model 


Method 2 (Nathan et al.") 


MPG: 12-week SMPG profile 


Steady-state HbAlc 


Linear regression model 


Method 3 (Samtani") 


FPG: baseline to 12-week samples 


Longitudinal predictions of HbAlc 


Indirect response model for HbAlc 


Method 4 (Lledo-Garcia et al.") 


MPG: baseline to 12-week SMPG profiles 


Longitudinal predictions of HbAlc 


Simulation model based on 
glucosylation of red blood cells 


Method 5 (Moller etal.") 


MPG: baseline to 12-week SMPG profiles 
HbAlc: screening to 12-week samples 


Longitudinal predictions of MPG and 
HbAlc 


Coupled indirect response model for 
MPG and HbAlc 



See detailed description of each method in Methods section. 

FPG, fasting plasma glucose; MPG, mean plasma glucose; SMPG, self-measured plasma glucose. 
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but still underestimated HbAlc by 0.3-0.4% points. Only 
Metliod 5 overestimated HbAlc for some treatments (arms 
5, 7, and 11). Precision was substantially higher for the three 
methods using MPG (Methods 2, 4, and 5) than for the two 
methods using FPG (IVIethods 1 and 3). When evaluating the 
model performance by means of the root mean square error 
(RMSE), a measure of individual prediction errors (Table 3), 
the above conclusions were largely confirmed: the steady- 
state methods (Methods 1 and 2) provided the largest RMSE 



o -0.5 




Figure 1 Mean prediction performance per arm of each method for 
HbAlc prediction at end of trial (26/28 weeks). Predictions were 
obtained using 1 2-week data and prediction errors were expressed 
as the deviation between the mean of individual predictions and 
the mean of the observations per arm. Arm 1: Insulin glargine, 
Arm 2: Insulin detemir, Arm 3: Biphasic insulin aspart, Arm 4: 
Insulin glargine, Arm 5: Liraglutide 1.8mg -i- metformin, Arm 6: 
Metformin + rosiglitazone. Arm 7: Liraglutide 1.2mg -i- metformin 
+ rosiglitazone, Arm 8: Liraglutide 0.6 mg + metformin, Arm 9: 
Liraglutide 1 .8 mg + metformin, Arm 1 0: Glimepiride -i- metformin. 
Arm 11: Metformin, Arm 12: Liraglutide 1.2mg -i- metformin. Key 
to symbols: Method 1 (filled blue diamonds), Method 2 (filled red 
squares)," Method 3 (open blue diamonds),'^ Method 4 (open red 
triangles),^" Method 5 (filled green triangles)." 



with overall larger errors of predictions based on FPG than 
those based on MPG. Furthermore, the RMSE was consider- 
ably lower for Method 5 than for the other four methods. 

Figure 2 presents the prediction errors in terms of treat- 
ment contrasts between study drugs and comparators 
(AAHbAlc).The AAHbAlc in each panel of Figure 2 was cal- 
culated by subtracting the baseline-corrected mean of HbAlc 
at 26/28 weeks of the comparator arms (AHbAlc^^^p^^^j^^) 
from the baseline-corrected mean of HbAlc at 26/28 weeks 
of the study drug arm (AHbAICj,^^^ ^^^^g) for observed data or 
for predictions obtained using the five methods. The confi- 
dence intervals of AAHbAlc were obtained by comparing 
the distribution of baseline-corrected HbAlc of treatment 
and comparator, using f-test. Lines through 0 and -0.3% 
are inserted to indicate no difference and the noninferior- 
ity limits for comparison against a comparator defined in 
regulatory guidance. In line with the results presented in 
Figure 1 , the steady-state methods more frequently provided 
an incorrect conclusion compared with the longitudinal mod- 
els. Additionally, the longitudinal model based on MPG obser- 
vations (Method 4) and the model based on MPG observations 
in combination with early HbAlc observations (Method 5) 
provided the correct conclusion for all comparisons. The 
longitudinal approach using FPG (Method 3) failed to cor- 
rectly predict trial outcome in several of the comparisons. 

DISCUSSION 

We compared various methods for predicting steady-state 
HbAlc outcome (defined as HbAlc at 26/28 weeks in a 
clinical trial). In order to evaluate both the impact of the type 
of data used (MPG vs. FPG) and the method (steady-state 
vs. longitudinal models), classical regression techniques 
linking glucose to HbAlc at steady state and model-based 
longitudinal approaches were applied for prediction. The 
performance of each method was evaluated based on its 
ability to predict mean HbAlc at 26/28 weeks in each arm 



Table 3 Mean prediction errors and RMSE of methods for predicting HbAlc treatment outcome 







Method 1 


Method 2 


Method 3 


Method 4 


Method 5 


Arm no. 


Arm 


(Samtani— SS) 


(Nathan— SS) 


(Samtani) 


(RBC) 


(Moller) 


1 


Insulin glargine (comparator) 


-1.2 (2.4) 


-0.4 (1.4) 


-0.7 (1.3) 


-0.4 


-0.1 (0.5) 


2 


Insulin detemir 


-1.1 (2.0) 


-0.4 (1.4) 


-0.8(1.6) 


-0.4 


-0.2 (0.4) 


3 


Biphasic insulin aspart 


-0.5 (1.9 


-0.4 (2.0) 


-1.0 (2.1) 


-0.6 


-0.2 (0.6) 


4 


Insulin glargine (comparator) 


-1.2 (2.6) 


-0.5 (1.7) 


-0.3 (1.7) 


-0.3 


-0.1 (0.5) 


5 


Liraglutide 1.8mg + metformin 


-0.4 (0.8) 


-0.4 (0.8) 


-0.4 (0.7) 


-0.3 


0.1 (0.2) 


6 


Metformin + rosiglitazone (comparator) 


-0.3 (0.8) 


-0.2 (1.0) 


-0.4 (0.7) 


-0.3 


0.0 (0.3) 


7 


Liraglutide 1 .2mg + metformin + rosiglitazone 


-0.4 (0.7) 


-0.4 (0.9) 


-0.4 (0.8) 


-0.2 


0.1 (0.5) 


8 


Liraglutide 0.6 mg + metformin 


-0.5(1.2) 


-0.3(1.4) 


-0.3 (0.8) 


-0.4 


-0.3 (0.4) 


9 


Liraglutide 1 .8mg + metformin 


-0.4(1.3) 


-0.4 (0.9) 


-0.6(1.1) 


-0.4 


-0.3 (0.5) 


10 


Glimiperide + metformin (comparator) 


-0.3 (1.0) 


-0.3 (1.1) 


-0.5 (1.0) 


-0.4 


-0.3 (0.6) 


11 


Metformin 


-0.5(1.0) 


-0.2 (1.4) 


-0.5 (1.0) 


-0.3 


0.0 (0.5) 


12 


Liraglutide 1 .2mg + metformin 


-0.6 (1.1) 


-0.4 (1.0) 


-0.5 (1.0) 


-0.4 


-0.3 (0.5) 


All 


Overall mean prediction error (mean RIUSE) 


-0.67 (1.4) 


-0.38 (1.3) 


-0.55 (1.2) 


-0.37 (1.2) 


-0.73(0.4; 


All 


Overall absolute mean prediction error 


0.61 


0.38 


0.55 


0.37 


0.75 



Mean prediction errors and RMSE of HbAlc (%) predictions calculated as mean of individual predictions subtracted by mean of individual observations. HbAlc 
predictions at end of trial (26-28 weeks) were based on 12-week data. Steady-state predictions (Methods 1 and 2) were obtained using FPG/MPG values at 12 
weeks only. Overall mean and absolute mean prediction errors are shown in bold. 

FPG, fasting plasma glucose; MPG, mean plasma glucose; RBC, red blood cell; RMSE, root mean square error; SS, steady state. 
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(by mean prediction error and RIVISE) and on its ability to 
predict trial outcome expressed as the difference in treatment 
effect vs. comparator. Using these two evaluation criteria, the 
methods were in turn compared both with regard to their abil- 
ity to predict the observed HbAlc value and their ability to 
predict HbA1 c change from baseline vs. comparator. For both 
criteria, the methods using MPG data provided better pre- 
dictions than the ones using FPG. This was in accordance 
with theoretical considerations,^" as the MPG — in contrast to 
FPG — also contains information of drug effects on PPG. 

The best predictive performance was obtained with a 
dynamic model describing longitudinal progression of both 
MPG and HbA1 c (Method 5). As this model uses all the obser- 
vations up to 12 weeks collected for both MPG and HbAlc, 
it is not surprising that this model has the best predictive 
performance for end-of-trial HbAlc. Moreover, this method 
accurately predicted the outcome of treatment comparisons 
obtained for all eight comparisons included. Contrary to this, 
the two steady-state methods (Methods 1 and 2) were unable 
to predict the actual outcome for four of eight comparisons. 

In the study, we implemented each prediction method as 
presented in the original papers. Thus, we used fixed param- 
eters and constants as reported in the original publications 
and explained in Methods section. Based on our results using 
the published parameter values, all methods significantly 
underestimated the final HbAlc values. 

For the steady-state methods (Methods 1 and 2), we used 
the regression parameters reported by Samtani^^ and Nathan 
et al.^^ The possibility of reestimating the parameter values 
using, e.g., week 1 2 data from each treatment arm would likely 
have resulted in biased parameter values as HbAlc, contrary 
to MGP, would not be at steady state at this time point. More- 
over, the original parameter values were estimated on larger 
data sets of a more heterogeneous nature than the clinical 
trial data investigated in this study and thus we did not find it 
appropriate to change the regression parameters for the link 
between FPG/MPG and HbAlc reported in the original papers. 

The longitudinal model describing the relation between FPG 
and HbA1 c (Method 3) was implemented according to the orig- 
inal published version of the model. In three of four trials inves- 
tigated in this work, at least one of the treatment arms involved 
drugs targeting PPG. As the longitudinal model presented by 
Samtani^^ was originally estimated on trials with drug classes 
not targeting PPG, we also investigated an implementation 
with rate constants estimated for each arm. Estimating the 
glycosylation constants per arm, as opposed to per trial as 
recommended in the original publication, allows different gly- 
cosylation rates for drugs not targeting PPG and those who do. 
This did not improve the accuracy of the predictions (results not 
shown). Estimating the parameter for the lower limit of HbAlc 
(HbAICi^^g^iJ in each trial improved the performance of the 
model significantly For the same reason as for the regression 
methods, we did not find estimating the lower limit of HbAlc 
appropriate, as this parameter was implemented as being fixed 
across trials in the original paper, and the data investigated in 
this study were not as extensive as in the original paper by 
Samtani^^ where 12 trials were used for estimation. 

The second best method (Method 4) was the longitudinal 
model published by Lledo-Garcia et al?^ This model pre- 
dicted the AAHbAlc just as well as Method 5, but predictive 



performance was slightly worse for absolute HbAlc. This 
method also uses MPG to drive the glycosylation of RBGs 
but does not require estimation of any parameters. In this 
study. Method 4 was implemented as a pure simulation 
model, in the sense that all parameters except for residual 
variability were fixed using previously published data. Alter- 
natively, the published parameters with their uncertainties 
could be used as priors for reestimation of parameters 
based on the study data. Method 4, as it was published by 
Lledo-Garcia et al., has high parameter certainty and was 
developed using data much more informative regarding the 
underlying mechanism of HbAlc formation compared with 
those included in this analysis. This led us to conclude that 
reestimation of parameters with priors would not change 
the parameters significantly. The only parameter reesti- 
mated in this setting was the residual error for HbAlc, 
which was also the most uncertain parameter according to 
Lledo-Garcia and colleagues. 

In a previous paper,^^ we used both observed MPG values 
and FPG-derived MPG values as glucose input. Here, the 
implementation of Method 5 only included observed MPG 
values. This was done to be able to compare models across 
using only MPG values and models using only FPG values 
and to show that the use of MPG values alone also provides 
accurate predictions. 

Despite the superior performance of the longitudinal 
methods based on MPG, it is worth mentioning that each 
method presented in this paper has its own merit. The 
regression-based methods can be used to obtain HbAlc 
predictions based on a single value of either FPG (Method 
1) or MPG (Method 2) at steady state. These methods 
can thus be used to obtain predictions from early phase 
studies where longitudinal values of glucose are not being 
sampled. Furthermore, these methods have their merit for 
diagnostic purpose, by translating single glucose values 
into expected HbAlc. Clearly, using FPG in a longitudinal 
approach, as compared with MPG, also needs significantly 
fewer blood samples, which is one of the advantages of the 
approach presented by Samtani (Method 3).^^ 

Various factors could account for the underestimation of 
HbAlc for each of the methods. One could be that disease 
progression is not explicitly accounted for in any of the mod- 
els. The UK Prospective Diabetes Study reports a progres- 
sion of HbAlc of 0.2% per year, which would thus cause 
predictions at 26 weeks to be 0.1% higher compared with 
baseline. Such a progression should however be imple- 
mented on the glucose component (and not the HbAlc com- 
ponent) in order to align with the physiological principle that 
the glucose change drives the HbAlc change. 

In summary, we have outlined and evaluated different meth- 
ods for HbAlc predictions based on early thai data. In line with 
results from previous studies, our results show that using MPG 
provides more accurate predictions of HbAlc compared with 
using FPG, independently of treatment. Although the amount 
of samples needed to obtain estimates of MPG are substan- 
tially higher than for FPG, these samples can be taken at home 
using self-measured glucose values as applied in the studies 
included in this paper. Furthermore, in case longitudinal HbAlc 
measurements are available, we suggest including such mea- 
surements in the prediction of long-term HbAlc outcome. 
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METHODS 

Trial design and patients 

The trials used for evaluation of predictive performance con- 
sisted of three phase III trials and one phase IV trial of previ- 
ously approved pharmaceuticals.^""^^ The data from these trials 
were previously used to validate one of the models applied in 
this paper (Method 5).^" Trials were required to include glu- 
cose profiles and HbAlc sampling at least three times up to 
end of trial, which was at either 26 or 28 weeks. Furthermore, 
in order to use similar data for all methods, each subject was 
required to have an FPG and an MPG value at 12 weeks, and 
an HbAlc sample at end of trial (26-28 weeks). IVIPG values 
were obtained from glucose profiles sampled 7-1 1 times dur- 
ing the day, calculated as k[^C^^^J2Ah. FPG was calculated 
as the mean of all available samples taken before breakfast at 
each visit. Typically, one FPG sample analyzed in the labora- 
tory, and three prebreakfast SMPG values were available at a 
given visit, in which case the final FPG value was calculated as 
the mean of these four values. 

iVIethiods used for prediction of HbAlc 

Five methods were used to perform predictions based on data 
up to 1 2 weeks. The two methods relating single measurements 
of steady-state glucose to steady-state HbAI c were evaluated 
assuming the glucose measurement at 12 weeks to be repre- 
sentative of the glucose at steady state (Methods 1 and 2)."'^^ 
In contrast, the longitudinal versions of the FPG/MPG models 
used all FPG/MPG data from baseline up to 12 weeks (Meth- 
ods 3 and 4).^^'^'* The longitudinal glucose and HbAlc model 
(Method 5)^^ used both MPG and HbAlc values up to 12 
weeks. A detailed description of each model is provided below. 
Model codes appear in the Supplementary Appendix online. 

IVIethod 1 (FPG-HbA1c steady-state solution) 

In order to compare longitudinal methods with steady-state 
solutions, we applied the original steady-state relationship 
between FPG and HbAlc derived by Samtani.^^ This relation 
is based on a model built on an extensive data set with many 
different oral antidiabetic drugs administered to type 2 diabe- 
tes patients. The following relationship between glucose and 
HbAlc was derived in the original publication and applied here 
for prediction: 

HbAlc (%) = 0.5 ■ FPG j^lH!^ j + 2.84 

The original parameter values were used for method 
comparison. 

IVIethod 2 (MPG-HbA1c steady-state solution) 

Nathan et al.'"* previously established the relation between 
average glucose and HbAlc data from 507 subjects. The 
following linear relation was derived in the original paper 
(revised to relate HbAlc to glucose and not vice versa as 
presented in the original paper) and applied here to perform 
predictions of HbAlc: 



HbAlc (%) 



0.62 MPGf 



mmol 



+1.62 



IVIethod 3 (FPG-HbA1c model) 

The longitudinal model by Samtani^^ is derived based on 
mean data from various different phase II trials. In order to 
align with the original publication, we implemented the model 
using a fixed effects approach, thus keeping all interindivid- 
ual variability parameters fixed to 0. The following equations 
describe the relation between glucose and HbAlc: 



dHbAlc 
dt 



/CgFPG 



mmol^ 



/Cd-HbAlc- 1 



HbAlc,, 



HbAlc 



Here, and are the two rate constants for HbAlc produc- 
tion and degradation, respectively, which account for the time 
delay between FPG and HbAI c. The system is initialized at the 



baseline value of HbAlc. HbAlc, 



is a physiological limit 



parameter — which enables integration across trials with differ- 
ent baseline HbAI c levels. For our method comparison, k^ and k^ 
were estimated per trial using FPG and HbAI c values week 1 2, 



whereas HbAI c,„ 



, was fixed to 2.84% as reported.' 



Method 4 (MPG-HbAlc model) 

In order to incorporate the possibility of glucose levels affect- 
ing the life span of RBCs, Lledo-Garcia et al.'"^ formulated a 
transit compartment model for description of aging of RBCs. 
The HbAlc predictions from the model are the estimated 
fraction of glycosylated to nonglycosylated RBCs — both of 
which have to be determined. The below equations describe 
the nonglycosylated part: 



LS, 



dRBO, 
dt 



TVLS-fM^Y-e^' 
I 149 J 



PRE = g-Ztg-MPGLSP, 



NC 



RBCi(0) = PRE 
dRBCg 



/c,„ • PRE - RBCi • kt, - RBC, ■ kg ■ MPG 



dt 



/(,r+/CgMPG 

(RBCi - RBCs )-kt,- RBCs ■ /(g • MPG 



RBC2(0) = RBCi(0) 
dRBCa 



dt 



/(•,r+/CgMPG 

(RBCs - RBC3) ■ /c,r - RBC3 • kg ■ MPG 



RBC3(0) = RBCs(0)- 



/c,r + /c„MPG 



dRBCis 
dt 



RBCis(0) = RBCii(0) 



(RBCi 1 - RBCis ) • /c,r - RBC12 ■ /Cg ■ MPG 

kin 



/c,r + kg ■ MPG 



These parameter values were used for method comparison. 



Here LS, represents the mean value life span for the Ah individ- 
ual, ri, is the individual deviation from the population, normally 
distributed with a mean of zero and a standard deviation of 0.29 
(as previously reported). ^''TVLS is the typical value for life span, 
which in our case was fixed to 91 .7 days as previously reported. ^'^ 
The parameter ydescribed the shape of the effect of MPG on the 
life span of RBCs. k^ is the rate of glycosylation of RBCs, fixed 
to 8.37 -10"^ as previously reported. NC is the number of transit 
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compartments (i.e., 12) from which the transit rate per 
day can be calculated (k^). PRE Is the contribution com- 
ing from the precursor pool, which is calculated using the 
glycosylation and the life span of precursor, fixed to the reported 
value of 8.2 days with a standard deviation between subjects of 
0.34.^** k„ = '\ Is the normalized production rate of RBCs. 

The structure of the coupled differential equations system 
for the glycosylated RBCs follows the one written above, but 
with the first equation In this system written as 



dRBCi3 
dt 

RBCi3(0) 

dRBCu 
dt " 

RBCi4(0) 

dRBC,5 
dt 

RBCi5(0) 

dRBC24 
dt 

RBC24(0) 



PREg = 1-PRE 

: k;„ ■ PREg - RBCi3 • k„ + RBC, kg -MPG 
/CinPREg+RBCi(0)/CgMPG 

: (RBCi3 - RBCu) ■ /c,r + RBCg ■ kg ■ MPQ 
RBCi3(0) ■ /c„ + RBC2(0) ■ /Cg ■ MPG 

: (RBCi4 - RBC15) ■ /c„ + RBC3 • /Cg ■ MPG 
RBGi4(0) ■ /c„ +RBC3(0) ■ kg ■ MPG 

: (RBC23 - RBC24) ■ kt, + RBC12 kg MPG 
RBC23(0) ■ + RBCi2(0) ■ kg ■ MPG 



In order to obtain a predicted measure of HbAlc (%), the 
relation between glycosylated RBCs and the total population 
of RBCs is calculated according to: 

HbA1 c (%) = 1 00 ■ RBCgiy / RBC,o, 

Here, RBC^i^ represents the amount of glycosylated RBCs 
obtained as the sum of compartments RBC^g to RBC24, 
whereas RBC,^,^, is the total sum of RBCs in compartments 
RBC^ to RBCJ" 

For our method comparison, we used the published 
parameter values.^** The only reestimated parameter was the 
residual error for HbAlc, which was also the most uncertain 
parameter according to Lledo-Garcia and colleagues. 

Method 5 (MPG-HbA1c model) 

This model consists of two linked indirect response mod- 
els, linking MPG dynamics to HbAlc dynamlcs.^^ The model 
has both MPG and HbAlc as dependent variables and thus 
requires longitudinal data of both MPG and HbAlc. The 
structural model is described by 

= -/CcuLMPG • (MPG - MPG,3.ge,) 

MPG (f = screening) = MPGss 



dHbAlc 
dt 



Waic-(MPG + P)-/Co„ HbAlc -HbAlc 
■ (MPGss +P) 



where /c„„, „p^, /c,„ and /c„„, are parameters char- 
acterizing the first-order turnover of MPG and HbAlc, 
respectively. MPG^^ Is a parameter representing the esti- 
mated pretreatment steady-state level, whereas MPG,g^g^, Is 
assigned different values during the course of the trial: during 
the run-in period, MPG,g,gg, Is assigned the value of MPG^^^.^ 
(MPG at randomization), and during the treatment period, it 
is assigned the value of MPGp^^,,^^^,^^^, (MPG level at full treat- 
ment effect). The time between screening and baseline can 
vary between trials. As Indicated, both MPG and HbAlc are 
assumed to be at steady state at the time of screening, but 
not necessarily at randomization. The parameter |3 relates 
MPG to HbAlc at steady state and Is estimated per subject 
using a population model. 

For method comparison, values of MPG^^ MPG,^^^^,, 
''out MPG' ^'"^ P w^''® estimated per subject using a popula- 
tion approach, k^^ ^j^^,^ and /c^^, ^^.^^^ were fixed to 0.081% per 
mmol/L per week, and 0.226 per week, respectively.^^ 



Evaluation of prediction performance 

Two different metrics were used for evaluating prediction per- 
formance. Prediction performance on the absolute HbAlc 
scale was evaluated by comparing mean predictions and 
mean observations for each method and for each treatment 
arm at 26-28 weeks, depending on study. The prediction 
error e(HbA1c).for each arm (J) was calculated according to 



e(HbA1c)y = pHbAICy -HbAICy 

where pHbAICy repres ents the mean of the Individual pre- 
dictions for arm / and HbAICy represents the mean of the 
observations for arm / Thus, a negative value indicates 
underpredictlon of HbAlc. In order to assess the potential 
bias as well as the mean deviation, summary statistics were 
derived using absolute prediction errors as well as the actual 
numerical values. In case all values have the same sign, the 
mean deviation Is thus equal to the bias. 

The RMSE was estimated for each treatment arm j as a 
summary statistic over all subjects / as: 



RMSEi 



|X;>HbA1c,-HbA1c,)^ 



HbA1c(f = screening) = 



^out_HbA1c 



where pHbAlc, and HbAlc, represent the individual predic- 
tions and observations for subject ;, respectively, and n Is the 
number of subjects In treatment arm / 

Furthermore, all methods were evaluated with respect to 
their ability to predict phase III efficacy outcome with respect 
to performance against comparator. The difference In the 
ability to lower HbAI c between trial drug and comparator was 
calculated according to the following formula: 

AAHbAlc = AHbA1c(trial drug arm) 
- AHbAI c(comparator arm) 

AHbAI c = HbAI c(at 26 - 28 weeks) - HbAI c(at baseline) 



Software 

S-PLUS, version 8.0 (TIBCO, Palo Alto, CA) was used for 
data file processing, explorative data analysis, and plotting. 



www.nature.com/psp 



Comparison of Models for HbAlc Prediction 

M0llerefal. 



NONMEM version 7.1.2 (ICON Development Solutions, Elli- 
cott City, MD) was used for model estimation and simulation. 
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Study Highlights 

WHAT IS THE CURRENT KNOWLEDGE ON THE TOPIC? 

^ Several approaches for predicting steady-state 
HbAlc based on either PPG or MPG have been 
proposed. These methods span from simple re- 
gression techniques to more mechanistic meth- 
ods including turnover of glucose and HbAlc. 

WHAT QUESTION DID THIS STUDY ADDRESS? 

y We investigated the accuracy of published re- 
gression and model-based approaches built on 
PPG or MPG for predicting HbAlc clinical trial 
outcome from early data. 

WHAT THIS STUDY ADDS TO OUR KNOWLEDGE 



In the forecast of HbAlc clinical trial outcome, 
the use of MPG values is essential for obtaining 
accurate predictions. Based on our results, we 
further propose to use a longitudinal model for 
the predictions — preferably including early HbAlc 
measurements and integrating both glucose 
and HbAlc measurements in a pharmacometric 
framework. 



HOW THIS MIGHT CHANGE CLINICAL 
PHARMACOLOGY AND THERAPEUTICS 

y Our results provide evidence for a significant 
added value by application of longitudinal phar- 
macometric approaches compared with steady- 
state regression techniques. This was shown to 
provide accurate prediction of late-stage clinical 
trial efficacy outcome based on early trial data. 
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